Skip to main content

Data

The Titanic Survival dataset is simple - it contains details of passengers including personal details (name, gender, age, family), passenger details (class, cabin, embarked from, fare of ticket) which are input features, and whether they survived, which is the target feature.

There are 10 input features and 891 samples (passengers) for training and 418 samples (passengers) for testing/evaluation.

There are 3 passenger classes and three sources of embarkment. The Age and Cabin features contain some missing values. Embarked also contains 2 missing values.

Name, Cabin and Embarked are categorical features.